Discriminative Variable Subsets in Bayesian Classification with Mixture Models

نویسندگان

  • Lynn Lin
  • Cliburn Chan
  • Mike West
چکیده

We discuss the evaluation of subsets of variables for the discriminative evidence they provide in multivariate mixture modeling for classification. Novel development of Bayesian classification analysis uses a natural measure of concordance between mixture component densities, and defines an effective and computationally feasible method for assessing and prioritizing subsets of variables according to their roles in discrimination of one or more mixture components. We relate the new discriminative information measures to Bayesian classification probabilities and error rates, and exemplify their use in Bayesian analysis of Dirichlet process mixture models fitted via Markov chain Monte Carlo methods as well as using a novel Bayesian expectation-maximization algorithm. We present a series of theoretical and simulated data examples to fix concepts and exhibit the utility of the approach, and compare with prior approaches. We demonstrate application in the context of automatic classification and discriminative variable selection in high-throughput systems biology using large flow cytometry data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative variable subsets in Bayesian classification with mixture models, with application in flow cytometry studies

We discuss the evaluation of subsets of variables for the discriminative evidence they provide in multivariate mixture modeling for classification. The novel development of Bayesian classification analysis presented is partly motivated by problems of design and selection of variables in biomolecular studies, particularly involving widely used assays of large-scale single-cell data generated usi...

متن کامل

Efficient Discriminative Learning of Mixture of Bayesian Networks for Sequence Classification

Recently, it has been shown that Bayesian Network Classifier (BNC), a generative model applied to a classification task, yields comparable performance to sophisticated discriminative methods such as SVMs and C4.5 [1]. Improved classification performance of BNCs is due, in part, to optimization of the conditional likelihood (CML) [2]. Unfortunately, the CML optimization, often implemented as a g...

متن کامل

Discriminative Learning of Generative Models for Sequence Classification and Motion Tracking

I consider the issue of learning generative probabilistic models (e.g., Bayesian Networks) for the problems of classification and regression. As the generative models now serve as target-predicting functions, the learning problem can be treated differently from the traditional density estimation. Unlike the likelihood maximizing generative learning that fits a model to overall data, the discrim...

متن کامل

Large Margin Learning of Bayesian Classifiers Based on Gaussian Mixture Models

Abstract. We present a discriminative learning framework for Gaussian mixture models (GMMs) used for classification based on the extended Baum-Welch (EBW) algorithm [1]. We suggest two criteria for discriminative optimization, namely the class conditional likelihood (CL) and the maximization of the margin (MM). In the experiments, we present results for synthetic data, broad phonetic classifica...

متن کامل

Broad phonetic classification using discriminative Bayesian networks

We present an approach to broad phonetic classification, defined as mapping acoustic speech frames into broad (or clustered) phonetic categories. Our categories consist of silence, general voiced, general unvoiced, mixed sounds, voiced closure, and plosive release, and are sufficiently rich to allow accurate time-scaling of speech signals to improve their intelligibility in, e.g. voice-mail app...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013